Assessing the Performance of Different Time Series Classification Methods

نویسندگان

  • Xiaojin Li
  • Lexiang Ye
  • Eamonn Keogh
چکیده

Classification of time series has been attracting great interest over the past decade. Recent empirical evidence has strongly suggested that the simple nearest neighbor algorithm is very difficult to beat for most time series problems. While this may be considered good news, given the simplicity of implementing the nearest neighbor algorithm, there are some negative consequences of this. First, the nearest neighbor algorithm requires storing and searching the entire dataset, resulting in a time and space complexity that limits its applicability, especially on resource-limited sensors. Second, beyond mere classification accuracy, we often wish to gain some insight into the data. In this work we introduce a new time series primitive, time series shapelets, which addresses these limitations. Informally, shapelets are time series subsequences which are in some sense maximally representative of a class. As we shall show with extensive empirical evaluations in diverse domains, algorithms based on the time series shapelet primitives can be interpretable, more accurate and significantly faster than state-of-the-art classifiers.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Performance evaluation of the croissant production line with reparable machines

In this study, the analytical probability models for an automated serial production system, bufferless that consists of n-machines in series with common transfer mechanism and control system was developed. Both time to failure and time to repair a failure are assumed to follow exponential distribution. Applying those models, the effect of system parameters on system performance in actu...

متن کامل

Time Series Analysis of Non-Oil Export Demand and Economic Performance in Nigeria

T his study examines the impact of non-oil export demand on economic performance in Nigeria using annual time series data between 1975 and 2013. The study tests for the unit root and co-integration to determine the time series properties of our variables before using Vector Error Correction (VEC) model for both short- and long- run estimates and possible policy inferences. The result...

متن کامل

GDOP Classification and Approximation by Implementation of Time Delay Neural Network Method for Low-Cost GPS Receivers

Geometric Dilution of Precision (GDOP) is a coefficient for constellations of Global Positioning System (GPS) satellites. These satellites are organized geometrically. Traditionally, GPS GDOP computation is based on the inversion matrix with complicated measurement equations. A new strategy for calculation of GPS GDOP is construction of time series problem; it employs machine learning and artif...

متن کامل

Modeling and prediction of time-series of monthly copper prices

One of the main tasks to analyze and design a mining system is predicting the behavior exhibited by prices in the future. In this paper, the applications of different prediction methods are evaluated in econometrics and financial management fields, such as ARIMA, TGARCH, and stochastic differential equations, for the time-series of monthly copper prices. Moreover, the performance of these metho...

متن کامل

A NEW APPROACH BASED ON OPTIMIZATION OF RATIO FOR SEASONAL FUZZY TIME SERIES

In recent years, many studies have been done on forecasting fuzzy time series. First-order fuzzy time series forecasting methods with first-order lagged variables and high-order fuzzy time series forecasting methods with consecutive lagged variables constitute the considerable part of these studies. However, these methods are not effective in forecasting fuzzy time series which contain seasonal...

متن کامل

A Hybrid Time Series Clustering Method Based on Fuzzy C-Means Algorithm: An Agreement Based Clustering Approach

In recent years, the advancement of information gathering technologies such as GPS and GSM networks have led to huge complex datasets such as time series and trajectories. As a result it is essential to use appropriate methods to analyze the produced large raw datasets. Extracting useful information from large data sets has always been one of the most important challenges in different sciences,...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2015